An Instance-Weighting Method to Induce Cost-Sensitive Trees
نویسنده
چکیده
ÐWe introduce an instance-weighting method to induce cost-sensitive trees. It is a generalization of the standard tree induction process where only the initial instance weights determine the type of tree to be inducedÐminimum error trees or minimum high cost error trees. We demonstrate that it can be easily adapted to an existing tree learning algorithm. Previous research provides insufficient evidence to support the idea that the greedy divide-and-conquer algorithm can effectively induce a truly cost-sensitive tree directly from the training data. We provide this empirical evidence in this paper. The algorithm incorporating the instance-weighting method is found to be better than the original algorithm in terms of total misclassification costs, the number of high cost errors, and tree size in two-class data sets. The instance-weighting method is simpler and more effective in implementation than a previous method based on altered priors.
منابع مشابه
Evolutionary Induction of Cost-Sensitive Decision Trees
In the paper, a new method for cost-sensitive learning of decision trees is proposed. Our approach consists in extending the existing evolutionary algorithm (EA) for global induction of decision trees. In contrast to the classical top-down methods, our system searches for the whole tree at the moment. We propose a new fitness function which allows the algorithm to minimize expected cost of clas...
متن کاملConstructing Cost Sensitive Decision Trees Based on Multi-Objective Optimization
We propose a multi-objective optimization based on the cost sensitive decision tree building method. The misclassification cost, test cost, waiting time cost and information gain rate as four optimization goals by using the method of linear weighting are adopted to transfer the multiobjective optimization problem into a single objective optimization problem, as the splitting attribute selection...
متن کاملEnsemble Classification and Extended Feature Selection for Credit Card Fraud Detection
Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...
متن کاملTree-Approximations for the Weighted Cost-Distance Problem
We generalize the Cost-Distance problem: Given a set of sites in -dimensional Euclidean space and a weighting over pairs of sites, construct a network that minimizes the cost (i.e. weight) of the network and the weighted distances between all pairs of sites. It turns out that the optimal solution can contain Steiner points as well as cycles. Furthermore, there are instances where crossings opti...
متن کاملUsing a Relevance Model for performing Feature Weighting
Feature Weighting is one of the most difficult tasks when developing Case Based Reasoning applications. This complexity grows when dealing with ill-defined wide domains with a sparse case base. Moreover, most widely-used feature selection and feature weighting methods assume that features are either relevant in the whole instance space or irrelevant through-out. However, it is often the case th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Trans. Knowl. Data Eng.
دوره 14 شماره
صفحات -
تاریخ انتشار 2002